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Abstract 

Ensemble inequivalence has been observed in several systems. In particular 
it has been recently shown that negative specific heat can arise in the micro- 
canonical ensemble in the thermodynamic limit for systems with long-range 
interactions. We display a connection between such behaviour and a mean- 
field like structure of the partition function. Since short-range models cannot 
display this kind of behaviour, this strongly suggests that such systems are 
necessarily non-mean field in the sense indicated here. We illustrate our re- 
sults showing an application to the Blume-Emery-Griffiths model. We further 
show that a broad class of systems with non-integrable interactions are indeed 
of mean-field type in the sense specified, so that they are expected to display 
ensemble inequivalence as well as the peculiar behaviour described above in 
the microcanonical ensemble. 

PACS number(s): 05.20. Gg, 05.50. +q, 05.70.Fh, 64.60.-i 

I. INTRODUCTION 

Particle or spin systems for which the pairwise interaction potential decays at large 
distances with a power smaller than space dimension are called long-range or non-integrable. 
It has been suggested that at first order phase transitions such systems should display 
ensemble inequivalence also in the thermodynamic limit P]>0]- A few examples where this 
is explicitly shown, both analytically and numerically, have been published |],|3|-§. The 
specific heat, which is always positive in the canonical ensemble, may become negative in 
the microcanonical ensemble, and even temperature jumps when continuously varying the 
energy may appear. Negative specific heat was first observed in gravitational systems during 
the process of gravothermal collapse, but here the situation is made more complex by the 
singularity of the interaction at short distances (for a review, see Q). In this paper we 
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present a formal approach within which we can easily prove ensemble inequivalence, and 
all the unexpected features of the microcanonical ensemble naturally arise. The approach 
is based on the assumption, which we justify afterwards, that a sort of Landau free energy 
can be always introduced for long-range systems and that it is endowed of good analyticity 
properties. A preliminary version of these results has been shortly presented in a workshop 
proceedings ||. 



II. MEAN-FIELD AND ENSEMBLE INEQUIVALENCE 



In this section, we show the main result of this paper. We say that a system is of 
mean-field type if it satisfies the following condition: 

/oo 
exp[-N*(J3,m)]dm, (1) 
-oo 

where Zc(/3) is the canonical partition function and $?(/3,m) is a real analytic function of 
(3 and m. Note that we require this for finite values of N, but that we additionally require 
the function \l/(/5,m) to be independent of N. A weaker requirement might be that \1/ is, in 
some sense, well-behaved in the N —>■ oo limit as far as analyticity is concerned. However, 
we shall not pursue this line any further here. Note in particular that infinite range models 
always have this property, as follows from their solution using, for example, the Hubbard- 
Stratonovich transformation. 

From this, it follows almost immediately that ensemble inequivalence will in general 
occur. Indeed, from (jl]) one obtains the following expression for the phase space volume of 
the energy shell, which may be viewed as a kind of microcanonical partition function: 

Z M (e) = / P^ZoiiX), (2) 



■/-co 2tt 

where e is the energy per particle. Using analyticity to rotate the integration contour, this 
can be recast in the following form, after inverting the order of integration: 

/oo rioo rl\ 

dm expLV(Ae-*(A,m)]. (3) 

-co J—ioo Z7TZ 

One can now perform a saddle point integral to estimate the value of the A integral for 
large values of N. We argue that the dominant saddle point must lie on the real axis: If it 
were otherwise, we would, because of the reality properties of have two complex conju- 
gate dominant saddle points, and hence an oscillatory behaviour of the partition function. 
However, if the dominant behaviour of the partition function is oscillatory, it will generally 
take on at least some negative values, which is absurd. We may therefore limit ourselves to 
the consideration of real saddle points. Since the integration path is perpendicular to the 
real axis, however, the minima along the real A axis of the argument of the exponential will 
correspond to maxima when traversed along the A imaginary axis. We are therefore led to 
the following expression for the microcanonical partition function: 



'M{£) = I exp 



iVmin(Ae - \[>(A,m)) 



A 



exp 



iVmaxminfAe — ^(A, m)) 

m \ v " 



(4) 
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From this one immediately obtains for the entropy per particle 

Sm(c) — max min[Ae — ^(A, m))]. (5) 

This result can now be compared with the standard result for the canonical ensemble, which 
is obtained as follows: The free energy per particle is found, after a straightforward estimate 
of ([D via Laplace's method, to be given by 

F(fi) = p- 1 mmV(J3,m). (6) 

It is now readily seen, through standard thermodynamic identities, that the entropy is the 
Legendre transform of (3F{f3) with respect to (3. This then leads to the following expression 
for S c {e) 

S c (e) = mmmax[Ae - *(A,m))], (7) 

From these results, a few consequences are immediate: 

1. The two forms of the entropy need not be equal. Indeed, if we could prove that 
^(A, m) has a unique extremum under certain conditions, we could argue for equality. 
However, we know that this is not generally the case: Whenever a phase transition 
occurs, the function \l/(A,m) may have multiple extrema at least as a function of m, 
thus precluding any simple statements about the identity of the two entropies. 

2. Quite generally, however, one may say that 

S M {e) < S c (e). (8) 



This is a general property of such min-max combinations and is trivial to prove (see 
Appendix A). It is, however, satisfying, since it suggests that in some sense the mi- 
crocanonical entropy is a restricted entropy, and that the real reason why the two are 
different is that the microcanonical system cannot relax in certain ways, reaching the 
maximal possible entropy in the canonical ensemble. 

Let us now consider the behaviour of the function ^(A, m) at the equilibrium points 
(A*, m*) and (A^, m£j for the canonical and microcanonical problem respectively. For 
the canonical problem we find the following conditions for the point (A*, m*) to be the 
appropriate extremum 



3 2 ^ 
'dX 2 

dm 2 



< 

> 0. (9) 



These correspond quite naturally to the usual stability conditions for the thermody- 
namic potential \l/(A,m), which is concave with respect to A and convex with respect 
to m. However, if one goes through the same computation for the microcanonical 
ensemble, one finds 
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d 2 ^ 

"dX 2 

d 2 ^ (d 2 V/dmd\) 2 



< 

> 0. (10) 



dm 2 d 2 ^/d\ 2 

Note in particular how this implies that the second derivative of $ with respect to 
m can take either sign. In fact, one verifies that the above two conditions have the 
following geometric meaning: The first indicates that ^ is a local maximum in the 
A direction, and the second amounts to stating that the matrix of second derivatives 
of \I/ has negative determinant, that is, the extremum must be of saddle-point type. 
These conditions are clearly weaker than (|J), so that more extrema are allowed for the 
microcanonical ensemble than for the canonical. 

4. If we consider the failure of concavity of the entropy as a function of e, related to the 
findings concerning negative specific heats in certain systems, we again find that this 
can happen in the microcanonical case. Indeed, one sees immediately that this cannot 
occur in the canonical ensemble, since there the entropy is given as a minimum of 
linear functions, which is of necessity concave. On the other hand, the microcanonical 
expression yields a maximum over concave functions, which need not be concave. More 
specifically, an explicit evaluation of the second derivative of Sm(c) yields 

d 2 S M _d 2 ^ {d 2 ^/dmd\f 
de 2 ~ ~d>? d 2 V/dm 2 ' ^ ^ 

Again, since d 2 ^ /dm 2 can take both signs, one sees that the specific heat can do so 
as well. In fact, combining this result with (|10D, one finds that the sign of d 2 SM/de 2 is 
the opposite of that of d 2 ^ /dm 2 . This means that the specific heat is negative exactly 
when the value of m corresponding to microcanonical equilibrium is unstable from the 
point of view of the canonical ensemble. 

5. It has also been found that the temperature T can have a discontinuous dependence 
on e in the microcanonical ensemble. Again this readily follows from our formalism: In 
the canonical ensemble, one effects a Legendre transformation of the following function 

F{\) = max^(A,m). (12) 



Now F can have discontinuities in its derivative, but it follows from the analyticity 
of \l/ that it cannot have any straight line segments. From this follows through well- 
known considerations that the entropy can have straight line segments (corresponding 
to ordinary first-order phase transitions), but no jumps in its derivative. On the other 
hand, the microcanonical entropy 5jw( e ) arises as the maximum over a set of concave 
functions, none of which can have jumps in the derivative. This clearly cannot exclude 
such jumps in Sm(c), which have indeed been found in specific models. 

6. Finally, we may note that, since the equivalence between ensembles is rigorously proved 
for interactions which are of sufficiently short-range |9j , this argument strongly suggests 
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that such systems are by necessity non mean-field, that is, that such functions as \I/ 
do not exist for such systems, or, more precisely, do not have the required analyticity 
properties. The argument is not fully rigorous: As pointed out above, it is in principle 
possible to have such an analytic function and yet not to have any cases in which the 
two ensembles differ. This would in particular be the case if, for all physical parameter 
values, the function \l/ had only one extremum. However, it is certainly very suggestive. 



III. APPLICATION TO THE BLUME EMERY GRIFFITHS MODEL 

Let us show how the above approach works in the case of the Blume-Emery-Griffiths 



(BEG) model JTI|, which is a spin Hamiltonian on a lattice defined as follows: 

N j / N \ 2 

ff = AE^4 E^. , (13) 



1=1 



where the are spin variables taking the values 0, ±1. A and J are positive coupling 
constants. This model has a line of second order phase transitions, which terminates in 
a tricritical point and is followed by a line of first-order phase transitions. This model, 
since it is in the mean- field limit, can be solved exactly using the Hubbard-Stratonovich 
transformation, to yield a representation of the partition function of the form (]!]). One finds 

#(/3 ; m ) = JL t In (l + 2e _/3A cosh(/3 Jm)) . (14) 



2 

We now show that finding the extremum of Ae— \I/(A, m) with respect to A and m is equivalent 
to looking for the extrema of the entropy at given energy e in the microcanonical ensemble. 
Indeed, if one defines 

1 N 

Tfl = —— Si 

?=^XX (15) 

1=1 

one readily finds that the entropy and the energy of (|13|) are given by 

q + m q + m q-m q-m 
S M {e) = (1 - q) ln(l - q) + In + In 

e = Aq-^-. (16) 



If one now maximizes 



S M (e)+fM(e-Aq+^f), (17) 



one finds that this leads to the same values of /i and m as the maximization of Ae — ^(A, m) 
gives for A and m respectively. Further, the resulting values for the entropy are shown 
to be equal. These somewhat messy computations are performed in Appendix B. It has 
been recently shown that in this model all the peculiarities mentioned in Section 2 actually 
occur p. 
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IV. GENERAL BEHAVIOUR AT SECOND ORDER PHASE TRANSITIONS 

AND TRICRITICAL POINTS 



In the preceding Section, we have shown how, in the case of a specific model, our formal- 
ism was able to re-derive the usual microcanonical expressions. However, as these expressions 
are quite cumbersome, it is not possible to derive in a transparent and explicit way the exact 
nature of the various thermodynamic anomalies encountered there. In this Section, we shall 
pursue a different approach: We shall consider quite general systems, but limit ourselves to 
the region in the vicinity of a phase transition. Under these circumstances, one can perform 
a Taylor expansion of the function \1/ as a low-order polynomial. This then allows to carry 
out a complete study of the stability conditions. 

To apply our formalism as developed in Section 1, we require an approximate expression 
for the function 



G(X, m; e, A) = Ae - ^(A, m; A), (18) 

where A represents one or more additional parameters of the Hamiltonian. Since we are 
interested in the vicinity of phase transitions, we are in fact concentrating on those regions 
in which new stationary points of G appear or disappear. Such regions are found in the 
vicinity of the solutions of the following equation: condition 

det D 2 G(\, m; e, A) = 0, (19) 

where D 2 G means the matrix of second derivatives of G taken with respect to A and m, and 
the variables e and A are chosen so that A and m are the corresponding stationary values. 

Let us first look at the simplest case in which there are no additional parameters. If we 
further assume that the function G is symmetric in m with respect to change of sign, one 
obtains the following result: it is generically possible for two new minima to arise whenever 
the second derivative of G with respect to m vanishes at the origin. If we consider G near 
this point and measure all quantities with respect to the critical values, which we therefore 
set equal to zero, we obtain the following expression for G in the vicinity of the transition: 

r A 2 

G(A,m;e) = ^— + a 2 \m 2 - m 4 + \e, (20) 

where we have introduced an arbitrary numerical factor to make the coefficient of the m 4 
term equal to unity and the minus sign is required by the thermodynamic stability of the 
system. The other constants are typically of order one. New phenomena may indeed appear 
when these are of the same order as m, in which case, however, new parameters A should be 
introduced and the system should be embedded in one that has an appropriate multicritical 
point. 

One now needs to test for the microcanonical stability conditions (1Q[). The first one, 



which is also necessary for canonical stability, is equivalent to the positivity of cq. The 
second one is written in terms of G as 

det^ = det( *£?Lt) < °> < 21 > 
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and the equations expressing the stationarity of G with respect to variations of A and m are 
given by 

2a 2 \m - Am 3 = (22) 
c A + a 2 m 2 + e = 0. (23) 

The equation ( p2|) determines A once m is known, unless m is zero (paramagnetic phase), in 
which case ( |2"2"| ) is a tautology. In this latter case, the condition fl2"T| ) is equivalent to a 2 A < 0, 
since Co is always positive. However, this is just the condition that ensures the absence of 
non-zero real solutions of (^2[ so that we may say in summary that the paramagnetic phase 
is microcanonically stable only if there is no ferromagnetic solution. 
On the other hand, if m ^ 0, it follows that 

a 2 A = 2m 2 , (24) 

from which it is immediate that ( pip always holds good. Therefore, the ferromagnetic 
solution is always stable whenever it exists. For this reason, there is no possibility of ensemble 
inequivalence near a generic second-order phase transition. 

The issue of tricritical points is a bit more subtle, since in this case we may have indeed 
ensemble inequivalence. The simplest case, to which we limit ourselves, is that in which a 
single additional parameter A is introduced and one considers the case where the second 
and the fourth derivative of G vanish simultaneously when m — 0. We assume that this 
occurs at e = A = 0, and we choose the value of A which corresponds to the tricritical point 
to be zero as well. Again we assume G symmetric in m around zero. In this case the Taylor 
expansion is given by 

c A 2 

G(X, m; e, A) = -y- + (a 2 A + 6 2 A)m 2 + (a 4 A + 6 4 A)m 4 - m 6 + Ae. (25) 

The equations determining A and m are now 

(a 2 A + b 2 A) + 2(a 4 A + b 4 A)m 2 - 3m 4 = (26) 
coA + a 2 m 2 + a 4 m 4 + e = 0. (27) 

Here (|26|) holds only if m ^ 0, otherwise it should be discarded. From (|27|) one obtains 



A = - a2m2 + £ + 0(m 4 ). (28) 
c 



If one substitutes (|28|) into (p5|) , one obtains a term of order m 4 with prefactor — a 2 /co, which 
is of order one and is not compensated by any other term of similar order of magnitude. 
Thus we see that in the microcanonical ensemble the point corresponding to the canonical 
tricritical point is not tricritical any more. Rather, since the m 4 term has negative sign, 
we do not need the m 6 term for stabilization any more and we are led back to the case 
of second order transitions discussed above. From this fact it immediately follows that 
ensemble equivalence cannot hold near canonical tricritical points. On the other hand, there 
may be microcanonical tricritical points, in which various microcanonical equilibria merge 
in the same way as in a canonical tricritical point. The study of such points is reserved to 
a later study. 
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V. MODELS WITH LONG-RANGE INTERACTIONS 



In this Section, we show that a large class of models having non-integrable interactions, 
actually satisfy our criterion (Jl|) and can hence be expected to display the whole gamut of 
phenomena discussed in Section 2. For definiteness' sake we restrict ourselves to spin models 
on a lattice. Extensions to more general cases is presumably unproblematic. Consider the 
following Hamiltonian 



H 



sli 



L -(d- a )j2 s -^^ + j2v[ s (k) 



k.l 



\k-l\ a 



(29) 



Here the s are spins which run over a discrete set S, the indices k run over a c?-dimensional 
lattice and a is an exponent between zero and d. The normalization of the interaction by 
^-(d-a) g Uaran tees that the Hamiltonian is in fact extensive. Note that the BEG model 
treated above, see (|T3"D, corresponds to a special case of (£2^), in which a = 0. 

To evaluate the partition function, we divide the volume in a large number of cells, such 
that the following conditions are satisfied: first, the cells are large enough that a partition 
function involving only the one-body terms can be evaluated accurately using saddle-point 
techniques; second, the cell should be small enough with respect to the whole sample, so 
that the interaction between the spins of one cell is negligible compared to the interaction 
with all other cells. Due to the non-integrable nature of the interaction, this can always be 
achieved by making the sample sufficiently large. We denote by x, y the centers of the cells 
and introduce the coarse-grained variables: 



p(x) 



E s{k). 



(30) 



fcec(x) 



Here C(x) denotes the cell at x and v its volume. We further define ^(f3, p) for a system of 
volume v by 



s(k) 



-IX*) 



V 
k 



P 



exp 



-{3j2v(s(k) 

k 



(31) 



In order to express the partition function in terms of the coarse-grained variables, we need 
an expression for the interaction energy in terms of p(x). This is obtained by the following 
consideration: As a consequence of the normalization, the interaction between any spin and 
those of its own cell or of nearby cells can be neglected. Since one only needs to consider 
distant spins, it is sufficient to use the first term of a Taylor expansion for the energy, which 
leads to the expression 



\x — y\ a 



(32) 



From this one obtains the following expression for the partition function 



Eexp 



p(x) 



(3 ^ p{x)p(y) 



E 



\x 



(33) 
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Here the sum extends over all functions p(x), which have integral N and N is the total 
number of cells. This can now be evaluated straightforwardly using Laplace's method. The 
main contribution comes therefore from solutions of the equation 

»* (ft rfS)) = J M r . (34) 

dp y K " L d ~ a y\x-y\ a K ' 

We now show that the right-hand side is independent of x. From this follows that the 
dominant contributions come from constant density profiles. One can therefore replace the 
functional integration in (|33|) by an ordinary integration over the constant value of p(x) and 
one has cast the partition function in the form (|l|). 

To show the constancy of the right-hand side of (|34]) , it is enough to note the following: 
The difference between the right-hand side evaluated at x and evaluated at x + a contains 
two types of terms: First, contributions due to cells near x or x + a. These give a finite 
number of contributions of order one to the sum, and no contribution to the right-hand side 
itself, because of the normalization. Secondly, one has distant terms, the contribution of 
which is estimated by a Taylor expansion. However, it is clear that in leading order, these 
terms are identical for x and x + a, thus showing the result. 

VI. CONCLUSIONS AND PERSPECTIVES 

Having shown that the mean-field formalism analysed in Section 2 already contains all 
the peculiarities of ensemble inequivalence, a first future direction of research might be 
to obtain exact, approximate or even numerical solutions of well chosen models with slowly 
decaying interactions, of the type discussed in Section 5. The aim will be to test the behavior 
of physical quantities, like the specific heat, as the exponent of the decaying interaction is 
varied from long-range to short-range coupling, similarly to what has been done for coupled 
rotators models |[Tl]JT^|. A related issue is the study of the behaviour of finite systems as 



a function of the ratio between the system size and the range of interaction. A further 
direction of research has to do with non-equilibrium properties. For instance, one might 
ask what happens when two systems of comparable size, both having negative specific heat, 
are put in contact with each other. This removes the energy conservation constraint of the 
two systems. What is the behavior of the full system in this case? We might expect the 
microcanonical entropy to come closer to the canonical, although remaining bounded from 
above by it. This would yield the paradoxical result that the coupling of two systems with 
identical intensive parameters leads to an irreversible increase of entropy. 

Finally, we have not at all mentioned in this paper non-equilibrium dynamical properties 
of long-range systems, which might well be analysed in the context of the thermodynamic 
formalism here introduced. For instance, the coherent clustering behavior observed in the 
antiferromagnetic mean- field XY model ||13||, and recently found to be a non-equilibrium 



phenomenon |14| , might well find some explanation in the context of the current formalism. 
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APPENDIX A: 

In this appendix, we show quite generally the inequality 

maxmin f(x, y) < minmax f(x, y), (Al) 

from which (|8|) follows immediately. Assume that the two extrema are attained at the two 
points (xi,yi) and (£2,2/2) respectively, that is 

f(xi,yi) < f(xi,y) 

f(x 2 ,y 2 )>f(x,y 2 ) (A2) 

From this follows 

f(x h yx) < f(x l , y 2 ) < f(x 2 , y 2 ), (A3) 
which implies the inequality (|A1|) . 

APPENDIX B: 

Here we complete the proof of the equivalence between the two ways of computing the 
microcanonical entropy of the BEG model, by performing the missing computations. The 
equations arising from the minimization of ( |14|) are 

Jm 2 ^ — Ae _AA cosh JmX + Jme~ AX sinh JmX 
6 2~ + 1 + 2e~ AA cosh JmX ~ 

e~ AA sinh JmX 
- m + 2 l + 2e-AAcoshJmA = °' (B1) 

which are readily combined to yield 

e -AA gjjjjj j m \ 



m = 2- 



1 + 2e _AA cosh JmX 
Ae~ Ax m coth JmX = 0. (B2) 

On the other hand, finding the extrema of ( |TTD leads to the following conditions 

ln(l -q) + In + In - fiA = (B3) 

q + m , q — m . . 

In In + 2 J[im = (B4) 

e + - Aq = 0. (B5) 
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From ( |B4| ) one obtains 

q = m coth Jm\i (B6) 

If one now substitutes (|E6| ) into ( |H3| ) and (|B5|) , one obtains the equations (P2|), where A must 
be identified with \i. We therefore see that the two sets of equations are fully equivalent. The 
verification that the value of the entropy is the same in both cases is now straightforward 
algebra. 
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